Skip to content

chore(data): refresh reddit baseline 2026-04-06#28

Closed
Sam-24-dev wants to merge 1 commit into
mainfrom
reddit-baseline-20260406-0138
Closed

chore(data): refresh reddit baseline 2026-04-06#28
Sam-24-dev wants to merge 1 commit into
mainfrom
reddit-baseline-20260406-0138

Conversation

@Sam-24-dev
Copy link
Copy Markdown
Owner

Summary

  • refresh reddit baseline data for 2026-04-06
  • prepare reddit fallback data before the weekly ETL

Validation

  • reddit_etl.py
  • sync_assets.py
  • validate_csv_contract.py
  • check_frontend_assets.py --mode strict --root .

Copilot AI review requested due to automatic review settings April 6, 2026 01:39
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates committed Reddit baseline assets for the 2026-04-06 refresh, intended to keep the frontend and fallback datasets in sync ahead of the weekly ETL.

Changes:

  • Refresh Reddit “temas emergentes”, “sentimiento frameworks”, and “intersección GitHub vs Reddit” CSV baselines (both in datos/ and frontend/assets/data/).
  • Regenerate reddit_sentimiento_public.json to reflect the updated baseline and legacy source pathing.
  • Replace Reddit history bridge JSON payloads with “missing/empty” placeholders.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
frontend/assets/data/reddit_temas_history.json Replaced topics history bridge with an empty “missing” payload.
frontend/assets/data/reddit_temas_emergentes.csv Updated frontend baseline topics counts.
frontend/assets/data/reddit_sentimiento_public.json Updated public sentiment JSON payload (legacy source).
frontend/assets/data/reddit_sentimiento_frameworks.csv Updated frontend baseline sentiment CSV.
frontend/assets/data/reddit_interseccion_history.json Replaced intersection history bridge with an empty “missing” payload.
frontend/assets/data/interseccion_github_reddit.csv Updated frontend baseline intersection ranks/differences.
datos/reddit_temas_emergentes.csv Updated repo baseline topics counts.
datos/reddit_sentimiento_frameworks.csv Updated repo baseline sentiment CSV.
datos/interseccion_github_reddit.csv Updated repo baseline intersection ranks/differences.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +2 to +8
"generated_at_utc": "2026-04-06T01:39:34Z",
"dataset": "reddit_temas_emergentes",
"source_mode": "history",
"snapshot_count": 2,
"latest_snapshot_date": "2026-03-30",
"previous_snapshot_date": "2026-03-28",
"topic_count": 10,
"source_mode": "missing",
"snapshot_count": 0,
"latest_snapshot_date": null,
"previous_snapshot_date": null,
"topic_count": 0,
Copy link

Copilot AI Apr 6, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

source_mode is set to missing and latest_snapshot_date/previous_snapshot_date are null. This will fail the enforced bridge integrity gate (scripts/check_bridge_integrity.py rejects bridges with source_mode in {missing, none, ''} and requires latest_snapshot_date), and it also makes this repo unusable as a Reddit baseline candidate for the ETL fallback (scripts/restore_reddit_baseline.py requires a non-null latest_snapshot_date). Keep the last known-good history bridge content (or regenerate it) so source_mode is non-missing and latest_snapshot_date is populated (and consistent with the intersection bridge).

Copilot uses AI. Check for mistakes.
Comment on lines +2 to +11
"generated_at_utc": "2026-04-06T01:39:34Z",
"dataset": "interseccion_github_reddit",
"source_mode": "history",
"snapshot_count": 2,
"latest_snapshot_date": "2026-03-30",
"previous_snapshot_date": "2026-03-28",
"coverage_pct": 30.0,
"comparable_count": 3,
"item_count": 10,
"latest_items": [
{
"tecnologia": "Python",
"tipo": "Lenguaje",
"ranking_github": 1,
"ranking_reddit": 7,
"brecha_abs": 6,
"promedio_rank": 4.0,
"direccion": "github_favorece",
"rank_github_prev": 1,
"rank_reddit_prev": 7,
"delta_gap": 0,
"trend_direction": "estable"
},
{
"tecnologia": "TypeScript",
"tipo": "Lenguaje",
"ranking_github": 2,
"ranking_reddit": 6,
"brecha_abs": 4,
"promedio_rank": 4.0,
"direccion": "github_favorece",
"rank_github_prev": 2,
"rank_reddit_prev": 5,
"delta_gap": 1,
"trend_direction": "aumentando"
},
{
"tecnologia": "JavaScript",
"tipo": "Lenguaje",
"ranking_github": 4,
"ranking_reddit": 10,
"brecha_abs": 6,
"promedio_rank": 7.0,
"direccion": "github_favorece",
"rank_github_prev": 4,
"rank_reddit_prev": 10,
"delta_gap": 0,
"trend_direction": "estable"
},
{
"tecnologia": "Angular",
"tipo": "Framework Frontend",
"ranking_github": 2,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 2,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
},
{
"tecnologia": "Go",
"tipo": "Lenguaje",
"ranking_github": 5,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 5,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
},
{
"tecnologia": "Next.js",
"tipo": "Framework Frontend",
"ranking_github": 1,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 1,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
},
{
"tecnologia": "React",
"tipo": "Framework Frontend",
"ranking_github": 3,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 3,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
},
{
"tecnologia": "Sin especificar",
"tipo": "Lenguaje",
"ranking_github": 3,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 3,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
},
{
"tecnologia": "Svelte",
"tipo": "Framework Frontend",
"ranking_github": 4,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 4,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
},
{
"tecnologia": "Vue 3",
"tipo": "Framework Frontend",
"ranking_github": 5,
"ranking_reddit": null,
"brecha_abs": null,
"promedio_rank": null,
"direccion": "incompleto",
"rank_github_prev": 5,
"rank_reddit_prev": null,
"delta_gap": null,
"trend_direction": null
}
],
"source_mode": "missing",
"snapshot_count": 0,
"latest_snapshot_date": null,
"previous_snapshot_date": null,
"coverage_pct": 0.0,
"comparable_count": 0,
"item_count": 0,
"latest_items": [],
Copy link

Copilot AI Apr 6, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This bridge is being reset to source_mode: missing with snapshot_count: 0 and null snapshot dates. That violates the repo’s bridge integrity expectations (see scripts/check_bridge_integrity.py) and prevents scripts/restore_reddit_baseline.py from selecting this repo as a valid fallback baseline (it requires latest_snapshot_date to be present and match the topics bridge). Restore/regenerate the last valid intersection history bridge so source_mode is non-missing and latest_snapshot_date is populated.

Copilot uses AI. Check for mistakes.
@Sam-24-dev
Copy link
Copy Markdown
Owner Author

Superseded by PR #29, which contains the validated Reddit baseline refresh with correct frontend bridges.

@Sam-24-dev Sam-24-dev closed this Apr 6, 2026
@Sam-24-dev Sam-24-dev deleted the reddit-baseline-20260406-0138 branch April 6, 2026 04:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants